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(54) Title: METHOD AND SYSTEM FOR TRACKING NETWORK USE 
(57) Abstract 

A method and system for tracking subscriber use of 
a network, such as an interactive media delivery network, 
which delivers programming to set top boxes coupled to 
a display device is disclosed. The system tracks events, 
including any change in status of a set top box caused 
by a change in programming or channel or a subscriber's 
activation and interaction with a particular interactive 
services application. Each application forms an event 
record comprising the application ID, event and time stamp. 
Collected event records are buffered, compressed, formed 
into packets and transmitted to a merge processor that 
combines event records with content data that describes the 
programming content distributed throughout the network. 
The event records and content data are merged to form 
event timelines for each subscriber's set top box that show 
subscriber activity or programming played to a subscriber 
over a selected time period. 




MERGE METADATA W 2 
WITH CUCKSTRgAM ] ^ 



FILTER RESULTANT DATA 
I 



SEND RAW DATA AND 
RESULTANT DATA TO MKIS I 



I 



1 

I MKIS I 
UIVAL/I 




MKIS DATABASE 



FOR THE PURPOSES OF INFORMATION ONLY 
Codes used to identify States party to the PCT on the front pages of pamphlets publishing international applications under the PCT. 



AL 


Albania 


ES 


Spain 


LS 


Lesotho 


SI 


Slovenia 


AM 


Armenia 


FI 


Finland 


LT 


Lithuania 


SK 


Slovakia 


AT 


Austria 


FR 


France 


LTJ 


Luxembourg 


SN 


Senegal 


AU 


Australia 


GA 


Gabon 


LV 


Latvia 


sz 


Swaziland 


AZ 


Azerbaijan 


GB 


United Kingdom 


MC 


Monaco 


TD 


Chad 


BA 


Bosnia and Herzegovina 


GE 


Georgia 


MD 


Republic of Moldova 


TG 


Togo 


BB 


Barbados 


GH 


Ghana 


MG 


Madagascar 


TJ 


Tajikistan 


BE 


Belgium 


GN 


Guinea 


MK 


The former Yugoslav 


TM 


Turkmenistan 


BF 


Burkina Faso 


GR 


Greece 




Republic of Macedonia 


TR 


Turkey 


BG 


Bulgaria 


HU 


Hungary 


ML 


Mali 


TT 


Trinidad and Tobago 


BJ 


Benin 


IE 


Ireland 


MN 


Mongolia 


UA 


Ukraine 


BR 


Brazil 


IL 


Israel 


MR 


Mauritania 


UG 


Uganda 


BY 


Belarus 


IS 


Iceland 


MW 


Malawi 


US 


United States of America 


CA 


Canada 


IT 


Italy 


MX 


Mexico 


UZ 


Uzbekistan 


CF 


Central African Republic 


JP 


Japan 


NE 


Niger 


VN 


Viet Nam 


CG 


Congo 


KE 


Kenya 


NL 


Netherlands 


YU 


Yugoslavia 


CH 


Switzerland 


KG 


Kyrgyzston 


NO 


Norway 


ZW 


Zimbabwe 


CI 


Cote d'lvoire 


KP 


Democratic People's 


NZ 


New Zealand 






CM 


Cameroon 




Republic of Korea 


PL 


Poland 






CN 


China 


KR 


Republic of Korea 


PT 


Portugal 






CU 


Cuba 


KZ 


Kazakstan 


RO 


Romania 






CZ 


Czech Republic 


LC 


Saint Lucia 


RU 


Russian Federation 






DE 


Germany 


U 


Liechtenstein 


SD 


Sudan 






DK 


Denmark 


LK 


Sri Lanka 


SE 


Sweden 






EE 


Estonia 


LR 


Liberia 


SG 


Singapore 







4 



WO 98/311 14 PCT/US98/00091 

METHOD AND SYSTEM FOR TRACKING NETWORK USE 

This invention is a method and system for tracking subscriber use of 
network applications, particularly network applications involving delivery of 
5 interactive media or video programming. 

BACKGROUND OF THE INVENTION 

Broadcast and cable television have long dominated the visual media 
market. New communications technologies, however, have accelerated 

10 demand for new types of media such as video on demand, interactive video, 
interactive gaming, home shopping or interactive advertising. Unlike 
broadcast television, viewers of these services typically are paying 
"subscribers," although payments from advertisers also pay a large share of 
the costs of providing these media services. 

15 To gauge the effectiveness of their spending, advertisers have long 

sought information on viewers 1 viewing patterns. A number of devices and 
techniques exist for gathering such information. For instance, U.S. Patent 
Nos. 4,258,386 to Cheung and 4,556,030 to Nickerson, et al., describe the 
general concept of deploying in viewers' homes devices for monitoring a 

20 viewer's television set ("TV") in order to accumulate data illustrating viewing 
habits such as which channels were watched at particular times. 
Accumulated data is then forwarded via telephone lines to a central location 
for analysis. Cheung sends data from particular monitoring stations at a 
preselected, specific "window" of time; interruptions to transmission during 

25 that window result in the Cheung system forwarding the data at another 
time. 

Other systems and methods provide somewhat more use data than 
just channel numbers viewed and time of viewing, Typically, however, the 
information is for a smaller subset of users. Thus, U.S. Patent Nos. 
30 4,816,904 to McKenna, et al., 4,912,552 to Allison, III, et al. and 5,374,951 
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to Welsh, all disclose monitoring "panelist" TV use in order to collect data 
about panelist viewing patterns as well as certain marketing information. 
Generally, panelist monitoring is used to gauge the effectiveness of 
advertising on selected groups of "panelists," each of which is one 
5 household in a group comprising a "panel," typically located in a particular 
geographical area. 

Monitoring not only determines which commercial and TV programs 
the panelist views but also may be used to gather information about which 
products panelists purchase. For instance, the U.S. patent to McKenna 

10 discloses a remote data collection unit located at a panelist home that 

monitors viewer identification data and TV functions (e.g., channel viewed, 
VCR viewing time or game time). Additionally, a wand is provided for 
inputting bar codes of purchased items. Monitored data is sent via the 
telephone network to a central location, which can also download 

15 questionnaires to the panelist and receive responses. Allison and Welsh 
disclose similar monitoring systems and methods. Instead of simply 
monitoring the channel number that a panelist was viewing at a particular 
time, Welsh discloses monitoring identification information carried in the 
television signal vertical blanking interval that identifies preselected 

20 commercials. After detecting and storing the identification information that 
identifies particular commercials viewed by panelists, the data is transmitted 
by telephone to a central location for analysis. 

Monitoring systems also have been used with some early interactive 
media systems. U.S. Patent No. 5,404,393 to Remillard discloses an 

25 interactive TV system. Among other elements of the system, a controller 
monitors TV channels and time/date stamps the selected channel so that, 
indirectly, viewers' programming choices may be monitored. Data is 
assembled into a "user profile," which is uploaded to an appropriate facility 
via the telephone network. 
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Nevertheless, while panelist monitoring systems like those of Allison, 
McKenna and Welsh or interactive television monitoring systems like 
Remillard's provide somewhat more monitoring data than just TV tuning 
data, they do so only for limited groups. For example, when more data is 
5 gathered (like purchase information), it is done only for the panelist groups, 
rather than for subscribers to the entire system. Also, systems like 
McKenna's that uses a wand for scanning bar codes are intrusive systems 
that require user action to collect data rather than collecting data passively 
and automatically. Other systems contemplate capturing only some of the 

10 data generated by subscriber's viewing activities or only some of the ratings 
information. For instance, previous systems typically capture ratings 
information that identify television shows viewed rather than whether the 
subscriber viewed commercials displayed during those shows. 

Perhaps more importantly, none of the systems described attempt to 

15 match "raw" information on channels viewed with programming information. 
Nor do those systems match viewing pattern information with demographics 
information about the particular users in order to provide more "targeted" 
advertising. 

SUMMARY OF THE INVENTION 

20 The present invention uses a collector, associated with a subscriber's 

set top box ("STB"), to obtain data about any "events" - subscriber actions 
or changes in programming ~ that are of interest. Data about virtually any 
events, from channels watched to volume changes to interactive 
applications invoked, may be captured with the collector. Event records 

25 comprising such data, as well as the identity of the application involved and 
the event time, are buffered. Periodically or on command, event records are 
uploaded from the buffer to a merge processor such as through an 
interactive network that allows for duplex communication with the STB. The 
merge processor, which may be a head end server or a workstation 

30 computer forming part of or coupled to the media delivery network, receives 
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(1) the event data and (2) content data that identifies programming content 
broadcast or delivered throughout the region in which the system is 
deployed. Timelines showing particular events over time may then be 
generated for each subscriber. Rather than just determining the channel 
5 viewed and time of day, the event timelines describe the programming or 
interactive applications selected by or shown to a subscriber over a selected 
period of time (e.g., 24 hours). 

The merge processor may further filter this collected and merged data 
to generate reports ranging from descriptions of a single user's viewing 

10 patterns to very high level viewing patterns showing the number of users 
who watched or participated in a particular program for a selected time 
period. Further, that information can be combined with billing and 
demographics information to provide detailed information on a particular 
subscriber's or group of subscribers' viewing and related buying patterns. 

15 The present invention thus involves a method for obtaining detailed 

information on every application invoked by a subscriber and information 
about the type of programming shown. The first step is to identify data that 
describe the events of interest that occur. Those events include: the 
channel viewed, a switch to another channel, a passive change in 

20 programming because of a commercial break or change to a new program, 
use of a VCR or other ancillary device, or invocation of an interactive 
application and subscriber commands given to the system during the 
application. Event data also includes start and stop times, identification of 
the subscriber's STB or specific data needed to be recorded for any 

25 particular interactive or other application. 

Event records are formed from this collected data and buffered before 
uploading through the interactive or other media delivery network to a 
headend, server or third party data analysis system. Before uploading, the 
captured data may be compressed and formed into packets for 

30 transmission. 
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Using the system or method of the present invention allows service 
providers to obtain ratings information and detailed information on 
subscriber viewing patterns and subscriber use of interactive applications. 
Thus, the present invention can track the number of subscribers viewing or 
5 watching particular programs, including advertisements. It also can track 
use of particular interactive applications such as video on demand. The 
invention automatically matches data describing programming content with 
event data describing a channel or application activated or controlled by the 
subscriber. This allows the invention comprehensively to track user 

10 "channel surfing." Also, the invention can compare subscriber 

demographics or billing information with viewing pattern information in order 
to tailor commercials to those subscribers; determine whether subscribers 
with a selected demographic background viewed a commercial of interest; or 
determine the demographics of subscribers that viewed selected 

15 commercials. 

Persons skilled in the art will recognize that the present invention may 
be used with numerous types of networked media delivery systems. For 
instance, the method or system of the present invention can be deployed on 
an interactive media delivery system or modified for use with a conventional 

20 cable television network, a wireless cable television network, or a home 
satellite television network. 

It is accordingly an object of the present invention to provide a system 
and method for collecting information about patterns of subscriber viewing 
and use of a media delivery system. 

25 It is another object of the present invention to provide a system and 

method for determining which network applications, particularly interactive 
applications, are invoked by particular subscribers. 

It is an additional object of the invention to provide a system and 
method for communicating collected information to a merge processor. 
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It is a further object of the invention to provide to the merge processor 
information about the programming content distributed over the media 
delivery system. 

It is yet another object of the invention to provide a system and 
5 method for merging the collected information with the programming 
information in order to obtain comprehensive information about 
programming shown to or network applications invoked by subscribers. 
Other objects, features and advantages of the present invention will 
become apparent upon reading the rest of this document. 

10 

BRIEF DESCRIPTION OF THE DRAWINGS 

Figure 1 shows a block diagram of elements of one embodiment of 
the system of the present invention. 

Figure 2 shows a block diagram of a Set Top Box as used with the 
15 embodiment of the present invention shown in Figure 1 and provided with 
a clickstream processor. 

Figure 3 shows a schematic diagram showing the upload cycle for 
collected event data. 

Figures 4A and 4B show the upload of collected event data from a 
20 selected Set Top Box through the network to the staging server shown in 
Figures 1 and 5. 

Figure 5 shows an overview of the staging server, its functions and 
its interconnections with various data sources. 

Figure 6A shows the system elements required for merging and 
25 parsing the event and content data collected by the present invention. 

Figure 6B shows the assignment of priority to content data 
necessary for completing the merge and parse process. 

Figure 7 shows the results of a merge and parse process of the 
present invention. 

30 
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DETAILED DESCRIPTION OF THE DRAWINGS 
System Overview 

Figure 1 shows a block diagram of the components of the system 20. 
System 20 is a demographics and programming ratings collection and 
5 analysis system that may be deployed for use on an interactive media 

delivery system such as the Interactive Video Services Network deployed by 
BellSouth Interactive Media Services. That interactive system is described 
in co-pending application S/N 08/428,718, assigned to the assignee of the 
present invention and which document is hereby incorporated in its entirety 

1 0 by this reference. However, persons skilled in the art will recognize that the 
present invention may be used with any of a variety of interactive media 
delivery systems, standard or wireless cable television systems, satellite 
television systems or other media delivery systems that allow duplex 
communication (perhaps with the return path via a separate (e.g., telephone) 

1 5 network) to a set top box ("STB") 30 coupled to a subscriber's display 
device, such as a television set or the like. 

In any event, Figure 1 shows various system 20 elements and 
subsystems that communicate with each other to transmit collected 
information, data error detection schemes and data acknowledgments. 

20 Briefly, the STB 30 communicates through a distribution network 52 with a 
video server 60, such as a video transfer engine ("VTE"), that may be 
acquired from Hewlett Packard ("HP"), with a video/object storage database 
54. Video server 60 couples to a video control server 56, such as an Inter 
Media Server available from Sybase and deployed on a platform such as an 

25 HP 9000, with a database 58. The video server control 56 controls video 
server 60 and also logs information about video server 60 use. A staging 
server 70 receives collected records of events of interest. These "event 
records" pass through the video server control 56, which also couples to a 
Marketing and Information System ("MKIS") 100 that couples to staging 

30 server 60, which receives (1) the event records and (2) content data from 
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various sources 120, 140 and 160 identified in Figure 1 and which describe 
programming content available through the interactive network to all 
subscribers. MKIS 100 may be coupled to a third party search and analysis 
system 110 that can provide customer support operations. 
5 STB 30 provides a platform by which (1) content is converted to a 

selected video format (e.g., NTSC or PAL) and presented to the subscriber 
or (2), for interactive systems, messages are exchanged (including video 
data) over a network 52 with the staging server 70. STB 30 also could 
include platforms capable of: (1) receiving messages from a user input 

10 device, such as a hand-held remote control unit; (2) translating video signals 
from a network-native format into a format that can be used by the television 
or display device; (3) inserting alphanumeric or graphical information into the 
video stream in order to "overlay" that information on the video image; (4) 
providing graphic or audio feedback to a user; or (5) possibly the most basic 

15 function, simply routing a traditional broadcast signal to a viewing device 
connected to the STB 30. Analogous terms to STB include: Set-Top 
Terminal ("STT"), Cable Converter, and Home Communications Terminal 
("HCT") and any of these devices may be coupled to or made a part of a 
display device for showing programming to subscribers. Generally, STB 30 

20 may be a Richmond or 8600X available from Scientific Atlanta, a CFT 2200 
available from General Instruments, Thomson's DSS or any other device 
equipped with (1) a microprocessor; (2) memory for operating instructions 
and storage; and (3) a control interface for accepting subscriber commands 
from a remote control device or control panel. 

25 For the particular embodiment of system 20 shown in the Figures, 

collected event records that are packaged for transport through system 20 
are called "clickstream" data or information. Figure 2 shows a clickstream 
processor 34 that resides in the memory, such as DRAM or the like, of an 
STB 30 and which has a clickstream kernel 36, buffers 42 or 44, a 



-8- 



WO 98/31114 PCT/US98/00091 

clickstream upload handler 40, a clickstream controller 38 and a clickstream 
event API (application programming interface) 41. 

Briefly, the clickstream kernel 36 buffers events passed to it by 
various network applications through the clickstream event API 41. 
5 Clickstream controller 38 accepts control messages from staging server 70 
and appropriately stores their pay load. Typical messages may be sent over 
the Extended Super Frame ("ESF") pass-through data link and control the 
uploading of clickstream data. Clickstream upload handler 40 accepts 
control messages over the system 20, which messages control the 
10 uploading of collected clickstream data over the reverse path through 

network 52. Also, the clickstream upload handler 40 stores the payload of 
these messages in appropriate and available memory and accepts the 
messages sent to it to acknowledge the receipt of uploaded clickstream 
data. 

15 Referring again to Figure 1 , video server 60 provides information from 

video/object storage 54 to the particular interactive system over which 
system 20 is deployed. Clickstream data collected at STBs 30 can be 
uploaded to staging server 70 in any number of ways. For instance, Figure 1 
shows that the distribution network 52 could couple directly to staging server 

20 70, allowing clickstream data packets sent from STBs 30 to be forwarded to 
staging server 70 directly and for staging server 70 to then return via the 
network 52 data acknowledgments. A network management controller 50 
controls the flow of information through the network 52. Alternatively, 
Figure 1 and, in greater detail, Figure 4B, show that clickstream data packets 

25 may be sent to the distribution network 52 to the video server 60. Video 
server 60 passes through both clickstream data uploads from various STBs 
30 and data acknowledgments returned to the STBs 30. A communications 
router inside the video server 60 redirects traffic to the appropriate 
destination. Video server control 56 similarly acts as a pass-through device 

30 for STB 30 clickstream data going to the staging server 70 and as a pass- 
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through device for staging server 70 data acknowledgments to the STBs 30. 
Also, video server control 56 may provide log information that identifies 
interactive applications invoked by particular STBs 30. That log information 
is provided to the staging server 70 so that video server control 56 also acts 
5 as another data source about content available over the network, like EPG 
metadata source 120, broadcast advertising metadata source 140, or 
advertising traffic control metadata source 160. Staging server 70 collects 
all such clickstream data and content data, analyzes and then forwards it to 
MKIS database 100 or to a third-party analysis engine and database 1 10, as 

10 described in more detail in the text associated with Figures 5-7. 

Journalinq of Event Data 
Clickstream processor 34 collects information to create a "journal" or 
log about all events or selected events of interest. An event is an action or a 
change in the state of a STB 30 that is deemed important to building a 

15 knowledge base on subscribers or their viewing patterns. For example, an 
event can include key presses to change channels or volume, mute, to enter 
the navigator for the interactive system, to turn the STB 30 off or on, to fast 
forward, to pause or to rewind a video obtained via the video on demand 
application. The events include applications called by the subscriber, such 

20 as interactive gaming applications, an electronic program guide, a video on 
demand or near video on demand application, a home-shopping application 
or a particular company's interactive application, such as The Weather 
Channel's weather on demand, World Span's travel on demand or Light 
Span's educational interactive application. Events include subscriber use of 

25 and control commands to peripheral devices coupled to the STB 30 or a 
subscriber's display device, such as a VCR or videodisk player. 

Each application residing on the STB 30 interfaces with the 
clickstream processor 34 to send selected data for maintaining a desired 
journal. Assuming that the system 20 is used with an interactive system, 

30 many different applications may be deployed over that system and may be 
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triggered by the subscriber. Some fairly typical applications that might be 
invoked include: 

• a cable television application that handles subscriber remote 
controls (like channel or volume changes); 

5 • an electronic programming guide application such as TV Data, 

Prevue or Star Sight interactive services; 

• an interactive game; 

• a video on demand or near video on demand application; 

• company specific applications, that might be offered by content 
1 0 provider such as the Weather Channel, MTV, Showtime, etc.; or 

• a navigator application to help the user choose options. 
Each of these applications, as well as some internal applications that the 
system 20 may wish to monitor, will be assigned a unique application 
identifier. 

1 5 Clickstream processor 34 interfaces with the various applications 

resident in the STB 30's operating system 32 and any third party applications 
33. Note that for systems using other types of STB 30's than the 
embodiment described in the Figures, those STB 30's need not have an 
operating system. Instead, all instructions can be written directly to the 

20 memories of those particular STBs. Applications 33 can be added by either 
downloading entirely new software directly to memory or by downloading 
new tables as described below. 

When an application 33 reaches a point where an "event" of interest 
has been generated, the application 33 stores an event record to memory. 

25 The application 33 then launches to the clickstream kernel 36 the event 
record, including information such as: (1) the application's 33 identification 
code (e.g., the "Cable Television Application" or a particular interactive 
application); (2) a count of the amount of information (number of bytes) to be 
journaled; (3) a "time stamp" that defines a unique point in time, e.g., by 

30 defining the date and time of day, accurate to the hour, minute or second; (4) 
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an identification code for the event, or (5) where the event data was stored. 
Clickstream kernel 36 uses the information provided by the applications 33 
to collect the event data, format it and place it into a buffer 42 or 44. Table I 
shows the type of information that will be generally sent by the clickstream 
5 processor 34 to the buffers 42 or 44. 



Table 1: Application Event Record 


Size 


Timestamp 


6 bytes 


Assigned Application ID 


16 bits 


Number Bytes to Follow (length) 


8 bits 


Application Specific Data with customized 
formats and lengths 


Multiple Bytes 



Global table II defines events of interest that each application can 
identify, collect, store in the "Application Specific Data" field and notify the 

10 clickstream kernel 36. These events could be as simple as a broadcast 
channel change by pressing the "Chan Up" remote key. All of these event 
types can be accessed and used by each application. While each 
application may not use every possible event type, the number of events 
available for collection allows system 20 to extract any pertinent usage 

15 information for analysis. Also, the use of the global table II increases 
system 20 efficiency because event types can be modified, added or 
removed. 



Table II: EVENT DEFINITIONS 


Code Event 


Content Related Events 


0x0000 


Passive Content 




Change 


Direct Key Presses 


0x0001 


TV<> nv 




Pressed 



Table II: EVENT DEFINITIONS 


Code 


Event 


Application/State Switching 
Related 


0x0028 


AC Power ON 


0x0029 


Application 
Switch (Normal) 


0x002A 


Application 

Switch 

(Abnormal) 
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0x0002 


Power Pressed 


0x0003 


One (1) Pressed 


0x0004 


Two (2) Pressed 


0x0005 


Three (3) 
Pressed 


0x0006 


Four (4) Pressed 


0x0007 


Five (5) Pressed 


0x0008 


Six (6) Pressed 


0x0009 


Seven (7) 
Pressed 


OxOOOA 


Eight (8) 
Pressed 


OxOOOB 


Nine (9) Pressed 


OxOOOC 


Zero (0) Pressed 


OxOOOD 


Channel Up 
Pressed 


OxOOOE 


Channel Down 
Pressed 


OxOOOF 


Volume Up 
Pressed 


0x0010 


Volume Down 
Pressed 


0x0011 


Last Channel 
Pressed 



0x002B 


Application 
Terminated 
(Normal) 


0x002C 


Application 
Terminated 
(Abnormal) 


0xO02D 


Soft Power OFF 


0x002E 


Soft Power ON 


0x002F 


OFF State 
Polling Event 


General 


0x0030 


Direct Channel 
Change 


0x0031 


Mute 


0x0032 


Un-Mute 


0x0033 


Volume Change 
Below 50% 


0x0034 


Volume Change 
Below 25% 


0x0035 


Volume Change 
Below 10% 


0x0036 


Volume Change 
Above 50% 


0x0037 


Volume Change 
Above 25% 


0x0038 


Volume Change 
Above 10% 


0x0039 


Change to 
Interactive Mode 


0x003A 


Change to 
Broadcast Mode 



Note that Table II defines relative volume changes (e.g. "volume 
change below 50% t " "volume change below 25%," etc.). Although the 
applications could capture the actual key presses that lead to these relative 
volume changes, that level of detailed information is of little use to system 
20 operators. Also, capturing all that detail leads to more records and 
higher demands upon the transmission network 52 when those records are 
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uploaded. Applications could also be configured to "filter" other unwanted 
details about other subscriber activities. For example, when subscribers 
"channel surf" by quickly flipping through a number of channels in a short 
period of time, the application could be configured not to record channel 
5 changes unless the subscriber paused for greater than a certain selected 
time period (e.g., 15 to 30 seconds). Again, this eliminates information of 
little use and decreases network traffic. 

Table III defines a small portion of a sample global channel 
identification table that proposes codes for identifying national and local 

10 broadcasters. Such a table allows any application journaling events which 
occur while subscribers are viewing broadcast or cable television programs 
to identify the network carrying the programming content by using a subset 
of the global table II. In this way channel lineups can be changed yet the , 
identifier for a broadcast or cable network would stay the same. The use of 

15 this mapping scheme eliminates the need to map an ever-changing channel 
number to a network. 



Table III: Broai 
Identif 


rtcast Channel 
cation 


0x0100 to 




0x01 1F 


News/Talk 
Shows 


0x0100 


CNN 


0x0101 


Headline News 


0x0102 


The Weather 
Channel 


0x0103 


CNBC 


0x0104 


CSPAN 


0x0105 


CSPAN-2 


0x0106 


America's 
Talking 


0x0107 


Talk Channel 


0x0108 


Court TV 


0x0109 


The Crime 



Table III: Broadcast Channel 


Identification 


0x0120 to 




0x01 3F 


Sports 


0x0120 


ESPN 


0x0121 


ESPN-2 


0x0122 


SportSouth 


0x0123 


The Golf 




Channel 


0x0124 


Classic Sports 




Network 


0x0125 


Prime 




Network 


0x0126 


NewSport 


0x0140 to 




0x01 5F 


Music 


0x0140 


MTV 
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Channel 


0x010A 


National 

Empowerment 

TV 







0x0141 


VH-1 


0x0142 


Country Music 
Television 


0x0143 


The Nashville 
Network 


0x0144 


The Box 


0x0145 


Video 
Jukebox 


0x0146 


MOR Music 
TV 


0x0147 


Music Choice 



10 



Table IV below shows some possible identification codes for particular 
applications. Note that each application could be programmed to insert its 
application ID code into the event record without accessing table IV. But by 
having each application access the table IV during the journaling process, 
the system's 20 ability to modify or add application ID codes easily is 
enhanced because such codes could be populated across system 20 by 
downloading an updated table IV Providing for downloading of new tables 
increases the application footprint and system 20 complexity so tables can 
also be part of the application programming. 



Table IV: Application identifiers 


ID Code 


Content 


0x0000 


Operating System 


0x0001 -F 


Operating System Sub-Systems 


0x0010 


Application Manager 


0x001 1 


Cable Television Application 


0x0012 


Clickstream Kernel 


0x0100 


EPG System 


0x0101 


Digital Pictures - Interactive Game 
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0x01 10-F 


Viacom - MTV / Showtime, etc. 


0x1000 


Interplay Written Applications General ID 


0x1001 


Interplay Runtime Engine 


0x1002 


Interplay Navigator 


0x1003 


Interplay VOD 


0x1004 


Interplay NVOD 


0x1005 


Interplay TownGuide 


0x1100 


The Weather Channel, Weather On-Demand 


0x1101 


Worldspan - Travel On-Demand 


0x1102 


Lightspan - Educational Interactive Application 


OxFFFF 


Missed Events Record 



Each particular application can simply reference the global 
application, event and channel identification tables (which periodically may 
be updated and then downloaded to STBs 30) in order to build an event 
5 record. Examples of application specific event records that may be created 
in this manner are shown in Tables V through VIII below and discussed in 
associated text. 

A cable TV application 33 may tune analog or digital broadcast 
services. When a command to change channels is entered, the cable TV 

10 application 33 is invoked. The cable TV application 33 begins building an 
event record by inserting an application ID and time stamp into the record. 
Next, the application 33 determines the "event ID" by cross-referencing the 
command with the global event ID table II for the proper code. Then, the 
application 33 journals the "Channel ID." 

1 5 Although the Channel ID could simply be the number of the channel, 

that information means little. The fact that channel 6 was watched more 
than channel 7 has little or no meaning unless networks and, ultimately, the 
content delivered by those networks are associated with particular channels. 
Accordingly, the Channel ID may be a field, like a 16 bit field, which uniquely 
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MISSING UPON TIME OF PUBLICATION 
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MISSING UPON TIME OF PUBLICATION 
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Rewind 


0x03 


Info (Help) Video or Screen Played 


0x04 


reserved 


0x05 


reserved 


0x06 


reserved 


0x07 



Table Vill below shows the event record for the Electronic Program 
Guide (EPG) application 33. The EPG application 33 records the application 
ID, timestamp and event ID records just as do the above applications 
described in tables V-VII. Additionally, it has an application 33 state ID field 
that identifies which of the display screens were accessed by subscribers, as 
shown below. 



Table VIII: Electronic Program Guide (EPG) 
Application Event Record 


Size/ Data 


Application ID: See Application ID table IV 


16 bits 


Timestamp: Identifies event time 


6 bytes 


Event ID: See Global Event ID table for Syntax 


16 bits 


Application State ID: See below for information 
tracked: 


8 bits 


Initial Display Screen 


0x00 


Look Ahead Display 4 Hour 


0x01 


Look Ahead Display 8 Hour 


0x02 


Look Ahead Display 12 Hour 


0x03 


Look Ahead Display 16 Hour 


0x04 


Look Ahead Display 20 Hour 


0x05 


Look Ahead Display 24 Hour 


0x06 


Reserved 


0x07 



Generally, similar information about other applications 33, such as 
home shopping, interactive gaming or any other new applications deployed 
over an interactive or other media delivery system, can be tracked in a 
similar fashion. Additionally, the journaling process may be used to track 
errors within the system 20, with clickstream kernel 36 journaling such errors 
using the same method as described above. 
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Over time, the journaling needs of system 20, or system 20 itself may 
evolve. Applications may be changed or new ones deployed. New events 
may become of interest to the operator of system 20. In order to provide 
flexibility for system 20, operators may download to STBs 30 new or 
5 replacement applications that will include the necessary processes for 
journaling all events of interest. 

Sample Journal 

Assume that Mr. Smith turns on his interactive television at 7:30 p.m. 
to watch a half hour news program on channel 5, which corresponds to CNN 

1 0 for that region. At 8:00 p.m. he accesses the Navigator application to order 
a video through the video on demand application. He then accesses the 
Video on Demand application, which automatically begins playing a video at 
8:04, pauses the video at 8:50 and begins playing again at 8:55 until it is 
completed at 9:45, at which point he turns off his interactive TV. 

15 Mr. Smith's activities generate the following event records shown in 

table IX below (for convenience, multiple events occurring under a single 
application are grouped even though separate records are created in 
operation): 



Table IX: Sample Event Records 


Cable Application Event Record 1 Content 


Data 


Application ID: See table IV for application ID Code 


0x0011 


Timestamp: Identifies event time 


1/1/96 7:30:01 
p.m. 


Event ID: See Global Event ID table II to retrieve code 
for "power pressed" 


0x002 


Cable Application Event Record 2 and 3 Content 


Data 


Application ID: See table IV for application ID Code 


0x0011 


Timestamp: Identifies event time (Date will be same for 
second entry) 


1/1/96 7:30:03 
p.m.; 8:00:01 
p.m. 


Event ID: See (1) global Broadcast Channel ID table III 
to determine that Channel 5 is CNN and locate code and 
(2) table II for an event ID code corresponding to an "iTV 
Press" by Mr. Smith. 


0x0100 
0x0001 


Navigator Application Event Record 4 Content 


Data 


Application ID: See table IV for application ID Code 


0x1002 



-20- 



WO 98/31114 PCT/US98/00091 



Timestamp: Identifies event time for accessing each 
screen. 


1/1/96 

8:01:30 p.m. 


Event IDs: See table II for event ID code that identifies 
an "enter" command by Mr. Smith to invoke this 
application. 


0x0021 


Application State ID Code: This shows Mr. Smith 
accessed the Main Menu 


0x01 


Navigator Application Event Records 5-6 Content 


Data 


Application ID: See table IV for application ID Code 


0x1002 


Timestamp: Identifies event time for accessing each 
screen. A separate record is created for each activity, 
with a timestamp showing initiation of each activity. 
Each record will have the corresponding event and state. 


1/1/96 

8:02:00 p.m.; 
8:03:00 p.m.; 


Event IDs: See table II for event ID code that identifies 
an "enter" command by Mr. Smith to invoke this 
application. 


0x0021 
0x0021 


Application State ID Codes: This shows Mr. Smith 
accessed the Movies Sub-Menu and Movie Sub-menu 
list. 


0x03 
0x05 


Video on Demand Application Event Records 7-9 
Content 


Data 


Application ID: See Application ID table IV (same for 
each record). 


0x1003 


Timestamp: Identifies event time for each event recorded 
by the application. (The day is the same for each record 
and each time corresponds with the activity identified 
below). 


1/1/96 

8:04:00 p.m. 
8:50:00 p.m. 
8:55:00 p.m. 


Event ID: See table II for event ID codes that identify Mr. 
Smith's play, pause and play commands. 


0x0022 
0x0024 
0x0022 


Application State ID Codes: These show Mr. Smith 
activated this application, played, paused and then 
played again his selected video. 


0x00 
0x01 
0x00 


Cable Application Event Record 10 Content 


Data 


Application ID: See table IV for application ID Code 


0x0011 


Timestamp: Identifies event time 


1/1/96 9:45:00 
p.m. 


Event ID: See Global Event ID table II to retrieve code 
for "power pressed" 


0x002 



Event Record Upload Cycle 



The variably sized event records are collected and then stored in one 
of two clickstream buffers 42 or 44. Capacity of each of the buffers may be 
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statically provisioned or the system 20 may addressably download to 
particular STBs 30 an appropriate buffer 42 or 44 size. A buffer 42 or 44 
may be an allocated, contiguous free area of STBs 1 30 memory set aside for 
buffering event records only. Although advanced database techniques like 
5 link lists or record pointers could be used, they would increase the 

application footprint and complexity. Because buffer sizes of about 15 kB 
would probably accommodate the journaling needs of most applications, 
advanced database techniques need only be used for larger buffers. Buffers 
up to 1 5 kB should allow at least 4 to 8 hours of peak channel "surfing" 

1 0 between uploads (channel surfing typically will generate the most event 
records). In any event, empirical analysis of network use should determine 
the optimum buffer size. 

Event records are directed to one of the two buffers 42 or 44, 
although a single or even more buffers could be used with the system 20. 

15 Conceivably, the system 20 could also be modified to upload event records 
in real time; however, that severely increases the possibility of instantaneous 
overloads in network traffic. Thus, system 20 preferably uses buffers 42 or 
44 to buffer collected event records until they are upload. 

Event records from a particular STB 30 may be uploaded in a format 

20 that assists in their transmission back through the distribution network 52 to 
the staging server 70. A header record may indicate the time the buffer 42 
or 44 was first opened, the number of bytes in the buffer 42 or 44, the 
originating STB 30 by address, the version of the clickstream kernel 36 
which generated the record and the type of data compression used on the 

25 following data (if any). This first header record may be of fixed length and 
uncompressed. Information following "Compression Type" may be 
compressed to save in transmission bandwidth. Table X below shows this 
general header format: 



-22- 



WO 98/31114 PCT/US98/00091 



Table X: Buffer Header Record 


Size 


Transaction Code 


8 bits 


Clickstream Version Number 


B bits 


Timestamp 


6 bytes 


Number of Bytes 


8 bits 


STB Unique Address Most Significant 


16 bits 


STB Unique Address Least Significant 


32 bits 


Compression Type 


8 bits 



When (1) a buffer 42 or 44 fills, (2) an upload timer event expires or 
(3) upon command from the staging server 70, the clickstream processor 34 
initiates an upload process. During that process the uploading buffer 42 is 
locked and subsequent event records are routed to and stored in the second 
5 buffer 44. When upload of buffer 42 is completed, records continue to buffer 
44 until the next upload time, after which buffer 44 locks and records go to 
buffer 42. This cycle continues to repeat. 

Figure 3 shows an upload cycle diagram illustrating one method of 
evenly distributing increased traffic on the network 52 caused by upload of 

10 event records. The clickstream upload cycle consists of several parameters 
that define a start time and a cycle over which the uploading of data occurs. 
The "first occurrence" parameter defines a starting time in history from which 
the cycle runs. The "cycle time" parameter defines the amount of time which 
elapses between periods of the upload cycle. When a cycle is complete the 

15 "upload duration" time starts, and the clickstream processor 34 of each STB 
30 will randomize an exact upload time within the upload duration. This 
timing of upgrades will distribute the network load evenly over the entire 
upload duration period. 

An example of the use of these parameters would be to define a 

20 period of time every day for STBs 30 within system 20 to upload data. 

Typically, the system 20 operator may want the data available every morning 
for analysis. Peak use of broadcast prime time or of interactive services will 
typically be from 7 p.m. until 12 p.m., during which time no uploads should 
occur in order to minimize the burden on the network 52. Beginning at 12 
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p.m., uploads of event records out of a buffer 42 or 44 would begin. In order 
to have all STBs 30 upload before 8 a.m., the STBs 30 may be divided into 
upload groups, e.g., 32, with each group uploading over a selected (e.g., 15 
minute) period. To achieve this upload cycle, the following parameters are 
defined in the Figure 3 cycle in table XI: 



Table XI: Upload Cycle Parameters 


Parameter 


Definition 


Cycle_First_Occurance_Start_Time 


12:00am Jan 1, 1995 + "X"*15 
minutes. "X" staggers each upload 
group by 15 minutes; X=number of 
Groups 


Cycle Time 


24 hours 


Upload Duration 


15 minutes 



A total of four upload cycles will be defined for each group of STBs 30, which 
allows for weekly uploads or any other combination of cycles to work around 
peak network 52 load times. 

STBs 30 can be instructed as to their role in uploading by sending 
from staging server 70 appropriate commands that are handled by the 
clickstream upload controller 38. For instance, the following commands may 
be addressed and sent by staging server 70 to a single or group of STBs 30. 



Table XII: Clickstream Upload Control Commands 


Octet# 


Contents 


TO 


Transaction Code MSB = 0x80 


T1 


Transaction Code LSB =0x10 


0 


Clickstream Processor Version Number 


1 


Global 

Flag 

(b1) 


Addressable 

Flag 

(b1) 


Group Address - Denotes the group 
of STBs to field this 1 
transaction (b5) 


2 


Collection On/Off Key Will turn clickstream collection On/Off 

to a STB or Group of STBs 
(non-Gobal only) 


3 


Perform Upload Now Key Will perform an upload on command. 

Will only upload on command if Global 
Flag is NOT set. 
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4, 


Suppress Uploand on Buffer Will keep the STB or Group from 
Ful uploading when buffer is full. The 

STB or Group will only upload on it's 
appointed upload cycle. 


5. 


Upioad_Cycle_Definition A STB will have 1 to 4 possible upload 

cycles defined. This will define any 
one of those cycles. 


6. 


Cycle First Occurrence Start Year (b8) Defines the time for first 
Time (Total b48) upload in a cycle. 


7 


Cycle First Occurrence Start Time - Month (b8) 


8 


Cycle First Occurrence Start Time - Day (b8) 


9 


Cycle First Occurrence Start Time - Hour (b8) 


A 


Cycle First Occurrence Start Time - Minute (b8) 


B 


Cycle First Occurrence Start Time - Second (b8) 


C 


Upload Duration 


(Total b24) - Hours(0-24) (b8) Defines a 
duration of time over which the 
STB randomizes upload start 
time. 


D 


Upload Duration 


- Minutes(0-59) (b8) 


E 


Upload Duration 


- Seconds(0-59) (b8) 


F 


Cycle Time 


(Total b32) - Days (0-14) (b8) Defines the 
periodicity (mean time) between 
uploads. 


10 


Cycle Time 


- Hours(0-24) (b8) 


11 


Cycle Time 


- Minutes(0-59) (b8) 


12 


Cycle Time 


- Seconds(0-59) (b8) 



Depending on how the system is configured, the commands instruct STBs 
30 to: 1) define the cyclic upload for various groups of or even all STBs 30; 
2) require STBs 30 to upload on command/polling control (addressable 
5 only); 3) suppress upload when a buffer 42 or 44 fills; or 4) turn on/off event 
record collection by particular or groups of STBs 30. 

Event Record Formatting. Upload and Capture 
After the upload process triggers, each STB 30 typically initiates 
upload by first locking the buffer 42 or 44 to be uploaded and then 
1 0 compressing the contents of that buffer 42 or 44. A number of different 
compression techniques may be used, however, about 50% compression 
may be achieved with LZW compression utilities. Such compression 
substantially reduces the load on network 52 caused by numerous STBs 30 
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uploading event records. Compressed data is divided into transmission 
"transactions" or "packets" and packet headers are addressed to indicate 
packet identification, IP destination address, etc. The actual network 
connection can be initiated by the operating system for the particular STB 
5 30. Persons skilled in the art will recognize that the type of and manner of 
invoking and implementing the network connection will vary depending upon 
the type of media delivery network over which system 20 is deployed. 

For instance, the STB 30 can be configured to insert UDP/IP headers 
and trailers taken from the RFC 791 or RFC 768 specifications published by 
10 the ISO. Each data packet may have UDP/IP protocol built around a Level 1 
pass-through header, such as shown in Table XIII below: 



Table XIII: UDP/IP Protocol for Headers 


IP Header 


IP Version Header Type of 
Length Service 


Total Length 


Identification 


Flags Fragment Offset 


Time to Live Protocol 


Header Checksum 


Source IP Address 


Destination IP Address 


UDP Header 


Source Port 


Destination Port 


Length 


Checksum 



In the embodiment shown in the Figures, the clickstream processor 
15 34 will identify a particular VSP - Video Sen/ice Provider, which is an entity 
connecting to network 52 to distribute services -- like VSP 66 shown in 
Figure 4B, as the destination of these data packets. All of the data to be 
uploaded appears as "payload" to the STB 30, the signaling network 52, the 
network management controller 50, and the event capture process 71 on the 
20 staging server 70. After an appropriate header and trailer inserted at the 
STB 30, the upload data packet may have the format shown in Table XIV: 
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Table XIV: Clickstream Upload Data Packet 


Octet# 


Contents 


TO 


Transaction Code MSB = 0x80 


T1 


Transaction Code LSB =0x18 


0 


Clickstream Processor Version Number 


1 


Upload Sequence Number 


0x02 

thr. 

OxFA 


Clickstream Upload Buffer Data Structure (as shown in Table 
I and X). The data may be broken up into as many reverse 
path transactions as necessary to complete data upload. 



Providing two buffers 42, 44 allows event record collection to continue 
during upload. Assuming buffer 42 is being uploaded, if the second buffer 
5 44 fj||s during the upload process, a buffer overrun condition occurs. To 
account for such an occurrence, the buffer trailer record sent during upload 
from STBs 30 may denote such an error condition. The structure of the 
buffer trailer record may take the form as shown in Table XV below and 
include a time stamp, assigned application identification, length and upload 
10 code. 



Table XV: Buffer Trailer Record 


Size 


Timestamp 


6 bytes 


Assigned Application ID 


16 bits 


Number Bytes to Follow (length) 


8 bits 


Upload Status Code 


8 bits 



These upload status codes identify the stage of the upload process at 
the time a buffer 42 or 44 overflow occurred. Thus, some possible upload 

15 codes could include: upload not used, upload in progress, upload completed 
but no acknowledgment received, upload completed but only partial 
acknowledgment received or no upload attempted. This will let the staging 
server 70 know that STB 30 event records are missing beginning at that 
time. Also, receiving a buffer overrun record informs the staging server 70 

20 that buffer 42 or 44 sizes have not been set appropriately. Buffer 42 or 44 
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sizes can then be reset and released to the system 20 as an update or 
released to a particular STB 30 by sending it an appropriate command. 

Note that the packetization description above is for one embodiment 
of the system 20. However, generally, to upload collected event records, 
5 STBs 30 can initiate whatever "upstream" data transmission process used 
by the interactive, cable television or other media delivery system with which 
the system 20 is used. That process will upload the event records in the 
appropriate system format. 

In any event, for system 20, clickstream data packets are uploaded to 
1 0 the staging server 70 over a slotted-ALOHA (a contention-based standard 
transport protocol) data transmitter of the STB 30. Data acknowledgments 
from the staging server 70 are sent; each is addressed to particular STBs 
30. The frequency and period of data acknowledgments may be determined 
by considering network error rates, network packet error rates and causes of 
1 5 those types transmission errors. 

Figures 4A and 4B show in greater detail the clickstream data flow 
through the system 20. Briefly, Figure 4A shows that clickstream packets of 
event records are transmitted from each STB 30 to the network management 
controller 50, which acts as a video service provider router. From the 
20 network management controller 50, which manages traffic over network 52, 
packets are forwarded via the network 52, video server 60 and video server 
control 56 to the staging server 70, which couples to MKIS 100 and analysis 
engine 110. Thus, event records collected and buffered at STBs 30 are 
transmitted to the staging server 70 for collection and analysis. 
25 Figure 4B shows this process in more detail and also describes an 

event records capture process 71 at staging server 70. 

As noted, once a buffer 42 or 44 fills or the clickstream processor 34 
decides to upload data for other reasons (time expiration, low system 
utilization, commanded upload, etc.), the buffer 42 or 44 will be formatted, 
30 compressed and then uploaded through the system 20 to the staging server 
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70. The upstream data packets may travel from the network management 
controller 50 across the distribution network 52 to video server 60 through a 
process called IP ("Internet Protocol") tunneling, which is essentially 
automatic IP routing based upon information in the packet payload. The 
5 same process can be used to route packets through network 52 directly to 
staging server 70 without going through video server 60. Figure 4B shows 
that, at video server 60, an L1 pass-through process 63 uses a VSP routing 
table 67 to associate destination IP addresses with corresponding tags 
inserted in the received data packets. This process re-directs the data 

10 packets to the application server 66 L1 pass-through process 63 by 
associating the tags with the appropriate listed destination - here, the 
application server 66. The L1 pass-through process 63 on application server 
66 performs a similar function with the data packets, routing them based on 
a payload identifier (transaction code or other) to an event record capture 

1 5 ("ECAP") open server process 71 on the staging server 70. 

When the ECAP process 71 receives a clickstream data packet, it 
accepts the data packet and correlates the source address of the data 
packet with an upload session already in progress with a particular STB 30. 
If there is currently no upload in progress with that STB 30, then one is 

20 considered to be initiated. ECAP process 71 processes the upload of data 
in accordance with the particular protocol needed for the system 20. After 
receipt of all clickstream data packets associated with the upload from a 
particular STB 30, the ECAP process 71 sequences the packets into proper 
order (particular packets may have arrived out of their original transmission 

25 sequence because of transmission delays in network 52), decompresses the 
packets, eliminates transport overhead (e.g., trailers, headers, etc.) and 
stores them, such as in a flat file, for later analysis. At the end of a selected 
period, like 24 hours, the file is closed and a new one is opened, which 
allows a subsequent merge and parse process to batch process discrete 

30 files that cover discrete time periods. Immediately after initiation of and 
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during the ECAP process 71, an operation log is opened to record 
information about the initiation and termination of each upload session and 
any errors. 

As shown in Figure 5, staging server 70 will formulate and send a 
5 data acknowledgment to each STB 30 engaged in the upload process. One 
method of doing so is to send acknowledgments as addressable 
downstream level one pass-through transactions over network 52 to the 
STB 30. Such data acknowledgments provide redundant error correction 
because failure to receive them may alert STB 30 to a possible transmission 
10 error. 

Merging and Parsing 

Figures 6A and 6B show an overview of the merging and parsing 
process and Figure 7 shows sample results following that process. Briefly, 
the aim of the merge and parse process is to merge each STB 30's event 

15 records with various "metadata." "Metadata" refers to (1) programming of 
virtually any type shown on system 20 including the time and broadcast or 
cable network providing such programming or (2) interactive applications 
invoked by subscribers. For instance, metadata includes the following 
sources of data: EPG broadcast programming schedule data 82, broadcast 

20 advertising schedule data 84, local advertising schedule data or session- 
services advertising schedule data 86 and session-services programming 
schedule data 88. (Session-services advertising refers to advertising 
inserted by video server 60 during particular interactive sessions with the 
subscriber that are the session-services programming). 

25 Collectively, all of this data enters into a merge and parse engine 90 

that creates an event timeline 92 for each STB 30. Merge and parse engine 
90 may be deployed upon staging server 70 or the MKIS system 100. So 
deploying merge and parse engine 90 on staging server 70 allows collected 
event records to be merged and parsed. The resulting event timelines 92 

30 can be sent to MKIS system 1 00 for further analysis. 
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Timeline 92 provides a snapshot of activity on a particular STB 30 for 
a selected period (e.g., 24 hours) or for a selected event for instance, a 
timeline 92 would be created for each STB 30 tuning to a particular show or 
shows (e.g., a pay per view fight) that may occur over a selected period. 
5 Timeline 92 is created by merging event records with metadata about 
programming available over the network for the selected time period. 

To merge that data, proper priority must be assigned to data that 
otherwise may be conflicting. For instance, broadcast advertising data 84 
may indicate that a certain national ad was run at Time A. On the other 

10 hand, if the system 20 is an interactive system and the interactive server 
provided a targeted advertisement ("ad") also at Time A, as indicated by 
session-services advertising data 86, that targeted ad was inserted over the 
national ad at Time A. Thus, by assigning session-services advertising data 
86 a priority higher than national broadcast advertising data 84, the merge 

1 5 and parse engine 90 is able to create an accurate timeline 92 of 

programming delivered to a particular STB 30. Similarly, even a traditional 
cable or wireless cable network requires priority assignments. Typically, 
local cable operators typically are allowed to insert local ads over certain 
national ads (assuming they can sell that local ad time). 

20 Figure 6B depicts such priority assignments. Figure 6B shows 

several sources of data, such as EPG metadata, National and Local Insert 
ad metadata and Interactive Sessions metadata. EPG metadata is usually 
very broad - for instance, showing a football game on channel 1 from 1:00 
to 4:00 p.m. Thus, EPG metadata is assigned a priority lower than that of 

25 national ad metadata because a particular national ad will be overlayed into 
a particular time slot broadly defined by the EPG. In turn, local insert ad 
metadata trumps national ad metadata because the national ad metadata 
may not account for situations where a local network or affiliate inserts a 
local ad over the national ad scheduled for a particular timeslot. Finally, 

30 interactive sessions metadata, which reflects subscriber selections, has the 
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highest priority as it shows the subscriber stopped watching a particular 
channel and instead invoked an interactive session. 

Applying these priority rules produces a timeline 94 for each 
subscriber. Additional filtering criteria 94 are applied by the merge and 
5 parse engine 90 in order to generate a further refined timeline 94, as 

depicted in Figure 6A. For example, event records may include such highly 
granular and specific information as the number of volume ups or channel 
ups that a particular subscriber entered. One set of filtering criteria 94 may 
ensure that the timeline 92 includes only channels that were viewed for 

10 more than a threshold (e.g., 15 seconds) time period. This eliminates any 
very fast channel changes made by the subscribers, thereby simplifying the 
event timeline 92 because event records that do not meet the criteria 94 are 
filtered out of the event timeline 92. 

Merge and parse engine 90 also may apply other criteria to the 

15 filtered timeline 94 (or the original timeline 92), as shown in Figure 6. 

Specifically, advertisers may wish to apply "view" and "watch" criteria 96. 
This criteria 96 will identify those programs and advertisements that are 
"viewed" by subscribers for less than a certain threshold amount of time. 
Programming seen by subscribers for more than that threshold, would be 

20 identified as "watched" programming. For example, for a 30 second ad, the 
threshold might be 15 seconds. If a subscriber was tuned to a channel 
displaying that ad for less than 15 seconds he would be deemed to have 
simply "viewed" that ad; on the other hand, if the subscriber was tuned to 
the channel carrying that ad for 25 seconds of the ad's length, he would be 

25 deemed to have "watched" it. This criteria 96 allows system 20 operators to 
charge more for "watched" ads versus those that are merely "viewed." 
Similar criteria can be applied against programming in order to more 
accurately gauge ratings. Thus, for a 30 minute program, if a user was 
tuned to that program for less than 10 minutes, the view and watch criteria 

30 96 may decide that the program was only "viewed." In any event, applying 
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the view and watch criteria 96, merge and parse engine 90 creates "view" 
and "watch" lists 98 that are useful for the system 20 operator and 
advertisers who contract with system 20 operator. 

Note also that other criteria than simply how much time a tuned to a 
5 particular channel may be included in the view and watch criteria 96. For 
instance another criteria may be volume level. If a viewer was tuned to a 
channel for the full thirty second length of an ad but hit the mute button or 
changed the volume below a certain threshold for that ad ( view and watch 
criteria 96 may classify that ad as a "viewed" ad. 

10 Generally, merging and parsing should be done on discrete segments 

of data, such as 24 hour segments, as soon as possible in order to minimize 
the occurrence of un-resolved events. In other words, discrete events are 
simply pieces of the entire picture. To analyze only several hours of 
clickstream event data would not allow determination of such things as 

15 programming "watched" versus "viewed." 

Figure 7 shows a sample merge of event records or clickstream data 
80, EPG data 82 from Prevue or a similar service and broadcast advertising 
data 84 that creates a clickstream timeline 92, which shows both the 
channels selected by a subscriber and the content displayed on those 

20 channels while the subscriber watched them. 

A timeline 94 for each STB 30 is built and uploaded by staging server 
70 to the MKIS database 100 or a third party analysis engine and database 
1 1 0, either of which may store demographics and be used to run queries 
against the event timelines 94 and those demographics. Combining the 

25 timelines 94 with demographics information allows for even more detailed 
and granular information about subscribers and their viewing habits. For 
instance, consider the following examples: 

• Example 1: Widget Co. has ten different advertisements that it 
has been running on system 20, Widget Co. wishes to know 

30 whether subscribers are "viewing" or "watching" particular ads. 
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Because of the detailed information captured by the system 20 of 
the present invention, a query can be formulated to determine (a) 
which subscribers "watched" particular 30 second advertisements 
for greater than 15 seconds versus (b) which subscribers simply 
5 "viewed" the ad, for less than 15 seconds. 

• Example 2: When event timelines 94 (or view and watch lists 98) 
are loaded into MKIS 100 or analysis engine 110, the same query 
can be run for a particular demographic group. For instance, 
Widget Co. wishes to know which particular ads its primary 
10 customer base, baby boomers between ages 40 and 50 and with 

income over $50,000 per year, "watched" versus "viewed" their 
advertisements. 
Obviously, the system 20 can also be modified to target ads to 
particular demographic households based on feedback from parsed and 
15 merged data. Then, event records occurring after those targeted ads are 
broadcast over system 20 can be checked to determine whether the 
particular demographic market targeted watched or viewed the 
advertisement. 

The foregoing is provided to explain and disclose preferred 
20 embodiments of the present invention, modifications to which may be made 
that still fall within the following claims. For instance, the architecture and 
programming of the system may be modified. Or, a variety of different 
manufacturers' servers, set top boxes or databases may be configured in 
order to implement the system. The particular identification codes and 
25 allocated sizes show in the tables and described herein may also be greatly 
modified. Further modifications and adaptations to the described 
embodiments will be apparent to those skilled in the art and may be made 
without departing from the scope or spirit of the invention and the following 
claims. 
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11. A method for collecting information about viewing habits of 

2 subscribers to a media delivery network for delivering programming to 

3 numerous set top boxes, each capable of supporting different applications 

4 invoked and controlled by subscriber commands, the method comprising the 

5 steps of: 

6 a) programming each application to identify selected 

7 subscriber commands of interest; 

8 b) determining an application identifier corresponding to a 

9 particular application to which a selected command is 

10 addressed; and 

11 c) creating an event record comprising: 

12 (1 ) the application identifier; 

1 3 (2) an identification code corresponding to the 

14 selected command; and 

15 (3) a time stamp. 

1 2. A method according to claim 1 further comprising the step of 

2 accessing a table in order to determine the identification code for the 

3 selected command. 

1 3. A method according to claim 2 further comprising the step of 

2 accessing a table in order to determine the application identifier. 

1 4. A method according to claim 2 further comprising the steps of 

2 repeating steps a through c to collect a plurality of event records and 

3 buffering the plurality of event records. 



1 
2 



5. A method according to claim 4 further comprising the step of 
forwarding the plurality of event records to a merge processor. 
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1 6. A method according to claim 5 further comprising the step of 

2 coupling to the merge processor a data source selected from the group 

3 consisting of: broadcast identification information, interactive application use 

4 information, national advertising information and local advertising 

5 information. 



1 7. The method according to claim 1 in which the selected 

2 commands of interest are selected from the group consisting of: channel 

3 change commands, volume change commands, VCR commands, 

4 application invocation commands and application control commands. 

1 8. A system for collecting and processing information about 

2 subscribers 1 selection and use of programming distributed over a media 

3 delivery network, the system comprising: 

4 a) a merge processor coupled via means for communication to 

5 b) a plurality of set top boxes, each comprising a 

6 processor for (1) collecting a plurality of event records 

7 that describe selected commands from a subscriber to a 

8 particular set top box and (2) transmitting event records 

9 to the merge processor; 

10' c) wherein the merge processor forms an event timeline 

1 1 describing a subscriber's selection of distributed 

12 programming for a discrete time period by merging the 

1 3 event records with programming data describing 

14 programming available via the media delivery system. 

1 9. A system according to claim 8 wherein the programming data 

2 comprises data collected from at least two sources selected from the group 

3 consisting of: a broadcasting schedule source, a national advertising 
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4 schedule source, a local advertising schedule source and an interactive 

5 application use schedule source. 

1 1 0. A system according to claim 8 wherein each set top box further 

2 comprises a plurality of applications capable of being invoked by a 

3 subscriber. 

1 1 1 . A system according to claim 1 0 wherein each event record 

2 comprises: (1) an application identifier corresponding to the application 

3 associated with the recorded event; (2) an event identification code; and (3) 

4 a time stamp associated with the initiation of the event. 

1 12. A system according to claim 1 1 wherein each application 

2 creates an event record upon detection of selected commands from the 

3 subscriber. 

1 1 3. A system according to claim 8 further comprising a buffer for 

2 storing the event records before transmission. 

1 14. A system according to claim 8 wherein the merge processor 

2 forms an event timeline for each of the plurality of set top boxes. 

1 1 5. A system according to claim 14 further comprising an analysis 

2 engine for correlating the event timelines with demographics information 

3 describing the subscribers. 

1 16. A method for journaling information about subscriber use of a 

2 media delivery network for delivering programming and a merge processor 

3 for analyzing the resulting journaled information, the method comprising the 

4 steps of: 



-37- 



4 



> 



WO 98/31114 PCT/US98/00091 

5 a) collecting information about a plurality of subscribers' 

6 use of a media delivery network, the collecting step 

7 comprising: 

8 i) identifying commands of interest from each 

9 subscriber; 

1 0 ii) forming event records that record at least the 

1 1 commands of interest and a time associated with 

12 the command; 

13 b) transmitting event records to the merge processor; 

14 c) merging the event records with data describing the 

1 5 programming delivered over the media network in order 

16 to form event timelines, each of which describes the 

1 7 programming selected by a particular subscriber over a 

1 8 discrete time period. 

1 17. A method according to claim 16 wherein the identifying step 

2 comprises the step of correlating each command of interest with a global 

3 table comprising identification codes. 

1 18. A method according to claim 16 further comprising the step of 

2 filtering the event timelines in order to classify subscribers 1 viewing patterns 

3 into at least two categories. 

1 1 9. A method according to claim 1 8 wherein the first category 

2 comprises programming watched by a subscriber for greater than a selected 

3 threshold percent of the total program length. 

1 20. A system for determining the viewing habits of subscribers to a 

2 media delivery network for delivering programming, the system comprising: 
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3 a) a collector for collecting event records describing subscribers' 

4 selection and use of programming; 

5 b) means, coupled to the collector, for communicating event 

6 records to 

7 c) a merge processor for processing the event records to form for 

8 a selected subscriber an event timeline describing the 

9 programming delivered to a selected subscriber over a 

10 particular time period via the media delivery network; 

1 1 d) means for storing demographics information about selected 

12 groups of subscribers; and 

1 3 e) wherein the merge processor forms a plurality of event 

14 timelines and correlates the demographics information with the 

15 event timelines. 

1 21 . A system according to claim 20 in which the merge processor 

2 applies filtering criteria to the event records to determine the programming 

3 watched by a subscriber for greater than a selected percent of the 

4 programming. 

1 22. A system according to claim 21 in which the collector is 

2 deployed upon a set top box that is associated with a display device for 

3 displaying delivered programming. 

1 23. A system according to claim 22 in which the subscriber controls 

2 the set top box via a remote device in order to invoke and run a variety of 

3 applications and the collector forms event records by: 

4 a) identifying a code that corresponds to a command of 

5 interest entered by a selected subscriber; and 

6 b) storing in a buffer, associated with the collector, an 

7 event record comprising (1) the code corresponding to 

8 the command; and (2) a time stamp. 
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